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Abstract 

The Etherphone™ system has been developed to 
explore methods for extending existing multi-media office 
environments with the facilities needed to handle the 
transmission, storage, and manipulation of voice. Based on a 
hardware architecture that uses microprocessor-controlled 
telephones to transmit voice over an Ethernet that also 
supports a voice file server and a voice synthesis server, this 
system has been used fpr applications such as directory-based 
call placement, call logging, call filtering, and automatic call 
• forwarding; Voice mail, voice annotation of multi-media 
documents.' .voice editing using standard text editing 
fechniques. anS applications of synthetic voice use the 
Etherp hones. {or yoige transmission. Recent work has focused 
on the creation of a comprehensive voice system architecture, 
both to specify programming interfaces for custom uses of 
voice, and to specify the roles of different system 
components, so that equipment from multiple vendors could 
be integrated to provide sophisticated voice services. . 

1. Introduction 

Suppose Alexander Graham Bell had waited to invent 
the telephone ufttil personal workstations and distributed 
computing networks had been invented. What approach, 
would he take in introducing voice communications into the 
modern computing environment? It was an attempt to 
answer this question that led to the creation of a voice 
communications project within the Computer Science 
Laboratory at Xerox PARC. 



*An earlier version of this paper appeared as: D. Swinehart 
D. Terry, and P. Zellweger. An experimental environment 
for voice system development. IEEE Office Knowledge 
Engineering Newsletter /, 1. February 1987. 39-48. 



Stated more concretely, the project's aim was to extend 
our existing mulu-media office environment with the facilities 
needed to handle the transmission, storage, and manipulation 
of voice. We believed that it should be possible to deal, with 
voice as easily as we can manage text electronic mail, or 
images. The desired result was an integrated workstation that 
could satisfy nearly all of a user's communications and 
computing needs. 

A basic requirement was to provide conventional 
telephone facilities (so that casual users would not have to 
read a manual to make a phone call), but our goals went well 
beyond that We had observed that most enhanced voice 
communications facilities had been developed by designers of 
telephone systems. In contrast we wished to draw on our 
.experience as developers of personal information systems 
running on powerful workstations with graphical interfaces. 
We were convinced that the user would prefer to perform 
voice management functions using the power and 
convenience of workstation facilities such as on-screen 
menus, text editors, and comprehensive informational 
displays. 

These aims led us to explore two related research 
comains: 

• "Taming- the telephone": Despite an immense 
. investment in research and development over the last 
.110 years, the user interface and the functionality of 

* the telephone still leaves much to be desired. We 
contend that the personal workstation, combined with 
a telephone system whose characteristics we can 
control, make it possible to better match the behavior 
of the office telephone with the needs of its users. 
There are gains to be had in the placement of calls, the 
handling of incoming calls, and the capabilities 
available to telephone attendants (that is. switchboard 
operators, receptionists, and secretaries). 

• Taming the tape recorder": We also believe that 
workstation techniques for creating, editing, and 
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storing text or images can be modified to deal with 
digitally-recorded voice. Application areas such as 
electronic mail, document annotation, and dictation 
are candidates for improvement Speech synthesis and 
recognition devices can be added to provide 
translation between textual and spoken information. 
' These two sets of activities are clearly related. ? A carefully 
designed system can support novel applications of both live 
and recorded voice. * ' - JM 

In this overview we will describe the Etherphone 
systetf that we have developed and used to explore the 
integration of voice into a personal information environment. 
The folfewiiig sections sketch the present hardware 
architecture, -describe some of the more .compelling 
applicaiioas that have been built to exploit it and briefly 
explore the software *md systems issues that have surfaced.. 

2. Etherphone System Description 

in designing our prototype voice' system, we surveyed 
several possible hardware architectures; indudini extensions 
of our existing Centrex service or of a commercially available 
PA8X. Primarily because Centrex and PABX systems did 
not support programmable switching control, we concluded 
that the most effective .way to satisfy our needs was to 
construct our own transmission and switching; system . 
Ethernet -local-area networks which provide the data 
communications backbone supporting personal distributed 
computing at Xerox PARC have proven to be an effecuve 
medium for transmitting voice as well as data. 
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Our prototype voice system consists of the following 
types of components connected by Ethernets, as shown in 
Figure 1: 

Etherphories: telephone/speakerphone instruments, each 
including a microcomputer, encryption hardware, and an 



Ethernet controller. Etherphones digitize, packetize, and 
encrypt telephone-quality voice (64 Kbps, with silence 
suppression) and send it to each other directly over an 
Ethernet: they support conferencing by digitally 
summing the voice packets from other participants 
before converting to analog. Etherphone software is 
written in C The current environment contains 
approximately 50 Etherphones. which are used daily by 
members of the Computer Science Laboratory as their 
only telephone service. Each Etherphone includes a 
connection to a standard direct-dial telephone line for 
access to telephones outside the Etherphone system. 
We chose to separate the voice-processing functionality 
from the workstation for reliability, performance, and 
flexibility: voice-processing peripherals allowed us to 

, provide reliable voice processing without compromising 
workstation performance on other tasks and to add voice 
to different types of workstations without changing the 
voice hardware. Etherphones transmit voice on a 
separate, 1.5 Mbps Ethernet because the only Ethernet 
chips available to usjjn 1981 could not operate at 
standard speeds. Tests and calculations convinced us 
that our existing Ethernets would provide adequate 
bandwidth and service to carry voice as well as data. 
Additional information on the Etherphone hardware and 

. the Voice . Transmission Protocol can be found in a 
previous report 17 . 

Voice Control Server; a program that provides control 
functions similar to a conventional PABX and manages 
the interactions among ail the other components. It runs 
on a dedicated serverthat also maintains databases for 
public telephone directories. Etherphone- workstation 
assignments, and other shared information. The Voice 
. Control Server is programmed in the Cedar 
programming' erivironment 18 . Centralized server 
software limited the necessary size and speed of the 
Etherphone processor, and thus its cost, as well as 
limiting the amount of C software. 

Voice File Server: a service that can hold conversations with 
Etherphones in order to record or play back user 
utterances. In addition to managing stored voice in a 
special-purpose file system, the Voice File Server 
provides operations for combining, rearranging, and 
replacing parts of existing voice recordings to create new 
voice objects. For security reasons, voice remains 
encrypted when stored on the file server. 

TexMo-speech Server: a service that receives text strings 
and returns the equivalent spoken text to the users 
Etherphone. Each text-to-speech server is constructed 
by connecting a commercial text-to-speech synthesizer to 
a dedicated Etherphone and is available on a first-come- 
first-served basis. Two speech synthesizers, purchased 
from different manufacturers, have been installed. 

Workstations: high-performance personal computers with 
: large bitmapped displays and mouse pointing devices. 
Workstations are the key to providing enhanced user 
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Figure 1 A Cedar scrc«n in use. The tWo windows in the upper left show a document preparation task, including voice annoiauon of an earlier 
version of this paper for communication among the authors. The bottom window of this pair shows new voice (represented by arrowheads) being 
inserted in the middle or an existing voice annotation. The two windows in the lower left show a programming task that is monitoring pan of the 
voice annoiauon system. The two windows in the upper right show images created .with several graphical illustration packages. The command 
interpreter window at the lower right accepts user commands, similar to a Unix shell. The bottom row of icons contains files and tools that are 
active but are not currently being manipulated by the user. 



interfaces and control over the voice capabilities. We 
; rely on the extensibility of the local programming 
environment- be it Cedar, Interlisp. Unix, or 
whatever- to facilitate the integration of voice into 
workstation-based applications. Workstation program 
libraries implement the client programmer interface to 
the voice system. 

In addition, the architecture allows for the inclusion of other 
specialized sources or sinks of voice, such as speech 
recognition equipment or music synthesizers. 

All of the communication required for control in the 
voice system is accomplished via a remote procedure call 
(RPC) protocol 3 . For instance; conversations are established 
between two or more parties (Etherphones. servers, and so 
on) by performing remote procedure calls to the Voice 
Control Server. During the course of a conversation, RPC 
calls emanaung from the Voice Control Server inform 
participants about various activities concerning the 



•conversation. Active parties in a conversation use the Voice 
Transmission Protocol for the actual exchange of voice. 
Multiple implementauoris-qf the RPC mechanisms permit the 
integration of workstauon programs and voice applications 
programmed in different environments. 



3. Examples of Applications to Date 



Most of our user-level applications to date have been 
-created in the Cedar environment although limited functions 
have been provided for Interlisp and for standalone 
Etherphones.. This section describes the voice applications 
that are currently available in Cedar, including telephone 
management, text-to-speech synthesis, and voice annotation 
and editing. Figure 2 shows a typical Cedar screen using 
voice, text, and graphics to support programming and 
document preparauon activities. 
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''in order to make voice a ftrst-class citizen of the Qedar 
environment Etherphone functions are typically avaUable in 
SS through an Etherphone contro. P"*L*™g 
commands that can be issued in a command murp^er 
Sow. and through procedures that a m be mvoke d from 
client programs. This integration of voice capabilities will De 
discussed more fully in the next section. 

3.1. Telephone management 

The telephone management functions provide improved 
caoabillies for placing and receiving calls. Figure 3 shows an 
eChone coStro! window, cai.ed Finch, and a personal 
telephone directory window. 

Users can place calls by specifying a name. ■ number • or 
other attributes' of the called party A ^ 
database for local Xerox employees (about 1000 entries) is 
JSS^n the Voice Control Server. Users can afeo creafc 
Sna° directories, which are consulted before the system 
KJry to locate the. desired party. A soundex search 
mechanism 5 helps compensate for spelling uncertainties. 

A variety of convenient workstation dialing methods are 
provided: a user can fill in fields in the Finch tool . « ** 
nTmes or numbers from anywhere on the screen, use either of 
wo Srectory tools that present browsable l.sts * 
associated telephone numbers as speed-d,a hng buttons, or 
redial any previously-made call by clicking on its 
convert tog entry/ Calls can-also be placed by name or 
number from the telephone keypad. 



Calls are announced audibly, visually, and functionally. 
Each Etherphone user selects a personalized nngtune. such 
L a few bTrs of 'Mary Had a Little Lamb". This ,« ■ 
played at a destination Etherphone to announce calls t that 
user 1he caller hears the same tune as a nngback 
confirmation. During ringing, the telephone icon jangles w«h 
fsuperimposed indication of the caller's name, as shown n 
*e Soaom portion of Figure 4. An active conversation is 
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Figure 4. F.thtrpho.c system icons. The two -cons at die ^pper left 
show a closed personal telephone directory and a Find . *» . * «J 
The icon at the upper right shows an outgoing cal to PoHe 
ZdUTers hone (username Pol.eZ.pa). The four bom «ni show 
« «S« of an incoming call from Doug Wyatt: the three left 
ETof L group are animated. during * fc ^2? 
conversation icon is used after the call has been 
Animation and visual feedback in .he «ons prov.de usefu. 
information without consuming valuable screen area. 
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The procedure outlined above t* . / Short annotations, but for more complex annotations the 
user will need facilities to edit portions of voice. To Keep things simple for rapid annotation, all 
that appeared in the text was an ic on representing a complete voice ana ranee. To perform 
editing "operation^, <g;ind 7i?yar ll& ie user, selects a region ol text and burtons Edit Voice. A 
voice viewer opens up for each voice icon within the selection. Each of the selected voice icons 
at this point changes its appearance to that of an opened voice icon — it how displays a number 
that identifies "the corresponding voice viewer. Each voice viewer is labelled with its number, so 
that the user can easily see the associations between voice viewers and positions in text viewers, 
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Figure 5. Voice annotation and editing. The upper window shows voice annotations being added to a Tioga document (the voice annotation 
system documentation). The third line of menu buttons ("AddVoice PlayVoice -."Xncar the top of the window are used to manipulate voice. In 
the second line of text, a voice annotation appears on the word "short"; indicated by die comic-strip dialog balloon' surrounding the character. 
In the fifth line, a similar annotation has been opened for editing in the lower window, labelled "Sound Viewer #1". In the sound-and-silence 
profile in the lower window, white rectangles depict silence, while dark rectangles depict sound. The profile contains several contextual 
indicators to orient the user during editing. The playback cue (the gray rectangle underneath the word "score") indicates the progress of voice 
playback. A temporary marker in the form of a small cross has been placed in the . voice section marked "Intro". The textual annotations with 
arrows are permanent markers that will be stored with the document. 



represented as a conversation between two people with a 
superimposed indication of the other party's name (also 
shown in Figure 4). The system automatically fills in the 
Finch tool's Calling Party or Called Party field to allow easy 
redialing of the last call. It also creates a new entry in a 
con venation log. A user can consult the conversation log to 
discover who called while he was out of the office. 

Our methods of locating users in an office building 
utilize the personalized ring tunes, which allow Etherphone 
users to identify calls to them wherever they may be: in their 
own offices, within earshot or at other Etherphones. If an 
Etherphone user logs in at a workstation, his calls can be 
automatical!/ forwardM to the adjacent Etherphone. An 
additional feature, c^'le'd visiting, allows him to register his 
presence with a second workstation or Etherphone. such as 
during a meeting.' Registering With the destination location 
allows users to travel more freely than forwarding calls from 
the home location does. Each visit request cancefrany earlier 
requests. The common problem of forgetting to cancel 
forwarding is eased by ringing both Etherphones during 
visiting. 

A more detailed series of examples that demonstrate how 
the Etherphone system's telephone management capabilities 
can increase office productivity appears in a related paper 16 . 

3.2, Text-to-speech synthesis 

A user or program can generate speech as easily as 
printing a message on the display by using one of the Text-to- 
speech Servers. A user can select text in a display window 



arid click the Finch tool's SpeakText menu button. A 

program can call a procedure with the desired text as a 

. parameter. These features are implemented by creating a 
"conversation" between the user's Etherphone and a Text-to- 
speech Server. The system sets up a connection to the Text- 
fo-speech Server, sends the text (via RPC). returns the 
digitized audio signal (via the Voice Transmission Protocol), 
and closes the connection when the text has been spoken. A 
similar mechanism is used for voice recording and playback. 

Our primary uses for text-to-speech so far have been in 
programming environment, and office automation 
applications. Programming environment tasks have included 
spoken progress indicators, prompts, and error messages. 
Office automation applications have included proofreading 
(especially comparing versions of a document when one 
version has no electronic form, such as proofing journal 
galleys) and audio reminder messages generated by calendar 
programs. 

3.3. Voice annotation and editing 

This section describes the addition of a voice annotation 
and editing mechanism to Tioga, the standard text-editing 
program in Cedar. More information about this system can 
be found in an earlier paper. 

Tioga is a high-quality galley editor, supporting the 
creation of text documents that contain a variety of type faces, 
type stvles. and paragraph formats as well as illustrations and 
scanned images. Tioga is the underlying editor for all textual 
applications in Cedar, including the electronic mail system. 
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the system command interpreter, and other tools that require 
the user to enter and manipulate text. Wherever Tioga is 
used, all of its formatting and multi-media facilities are 
available. Thus, by adding voice annotation to Tioga, we 
have made it available to a variety of Cedar applications. 
Figure 5 shows a closeup r of a Tioga document containing 
voice annotations, one of which is being edited by the user. 

Any text character within a tibga document can be 
annotated with an audio recording of arbitrary length. The 
userjnterface of the voice annotation system is designed to be 
lightweight and easy to use. since spontaneity in adding vocal 
annotations is essential. Voice within a document is shown as 
a distinctive shape superimposed around a character, so that 
the document's visual layout is unaffected. Furthermore, 
adding voice to a document does not alter its contents as 
observed' by other programs (such as compilers). 

Toadd an annotation, the user simply selects the desired 
character within a text window and buttons AddVoice in that 
window s menu. Recording begins immediately, using either 
a hands-free microphone or the telephone handset and 
continues until the user buttons STOP; A voice annotation 
becomes part of the document althbdgh the voice data 
physically resides on the Voice File* Server. Copies of the 
document may be stored on shared file servers or sent directly 
to other users as electronic mail. To listen to voice, a user , 
selects a regidn containing one or mbre voice icons and 
buttons PlayVoice. 

Simple voice editing is available on a visual 
representation of the >pice, Users can select a voice 
annotation and open a,wmdow showing its basic sound-and- 
silence profile. Sounds, from |iie same or other voice windows 
can be cut and pasted together using the same editing 
commands supported by^the Tioga editor. Editing is done 
largely at the phrase level, representing the granularity at 
which we believe editing can *e done with best resujts and 
least effort for an office situation". - A lightweight 'dictation ( 
facility* that uses a record/stop/backup model can be used to 
record and incorporate new sounds conveniently. ;The 
dictation facility can also hie used when placing voice 
annotations directly into documents. 

Sound^and-silence profiles alone do not supply adequate 
contextual information for users to identify desired editing 
locations, so several contextual aids are provided. A playback 
cue moves along the voice profile during playback, indicating 
exactly the position of the voice being heard. While playback 
is in progress, a user can perform edits immediately or mark 
locations'for future edits. Simple temporary markers can be 
used to keep track of important boundaries during editing 
operations, while permanent textual markers can be used to 
mark significant locations within , extended transcriptions. 
Finally, the system provides a visual indication of the voice- 
editing history in an editing, window. Newly-added voice 



* Editing on a word or phoneme level causes two difficulties: ihe system cannoi 
easify identify word or 1 phoneme boundaries for the user, and inserting or 
deleting words or phonemes is likely to cause results that sound choppy. 



appears in a bright yellow color, while less-recently-added 
phrases become gradually darker as new editing operations 
occur. This use of color is similar to Lippman et o/S use of 
color in text editing 9 . 

The voice annotation facility is implemented on top of a 
flexible set of voice editing and management primitives . 
Instead of rearranging the contents of edited voice files, the 
Voice File Server builds a data structure to represent the 
edited voice end stores it in a server database. This data 
structure, called a voice rope, insists of a list of intervals 
within voice files. Voice is included in documents, electronic 
mail, and client programs solely by reference to voice ropes. 
To promote sharing voice throughout our distributed 
personal computing environment voice ropes are immutable: 
recording and editing produce new voice ropes rather than 
modify existing ones. A modified style of reference counts 
enables unwanted voice ropes to be garbage collected. 



: 4. Progress toward a Voice System Architecture 

The original goals of.the Etherphone project were to 
produce experimental prototypes that could "tame the 
telephone" or "tame the tape recorder" in novel and useful 
ways. As the > project developed, however, a more 
-fundamental goal emerged: to create and experimentally 
validate a comprehensive architecture for voice applications. 
The best way to explain the value of a voice architecture is to 
4ist some of the properties it should have: 

• Completeness. It must be able to specify the role of 
telephone .transmission and switching, workstations, 
voice file servers, and other network services in 
supporting ail the kinds of capabilities we have 
identified, such as telephone services and recorded 
voice services. 

, • Programmcbiliiy. It must permit workstation 
programmers to modify existing voice-related 
applications and to create new ones. Simple 
applications should be easy to write, requiring little or 
no detailed understanding of how the system is 
. implemented. More elaborate applications might 
require a more thorough knowledge of the underlying 
facilities. The architecture must be designed to 
minimize the effect of faulty programming on the 
reliability and performance of the overall system. 
(Users of experimental software might experience 
program failure of reduced performance, but other 
users should not.) 

• Openness, it should define the role of each major 
component, so that different kinds of components 
cduld be used to provide the same functions. In this 
way. multiple vendors of telephone and office 
equipment could cooperate to provide advanced voice 
functions in conjunction with workstation-based 
applications. For example, a conventional PABX 
(business telephone system) could be used in place of 
the Etherphones to provide voice switching. Ideally. 
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such an architecture would evolve into a standard for 
voice component interconnection. 

The development orthe Etherphone system has included . 
an ongoing effort to define such an architecture, and to 
implement the system in compliance with it. Following the 
. general methodology employed by such modem 
communications architectures as the ISO reference model 7 or 
the Xerox Network Systems protocols^, the voice 
architecture is expressed as a set of layers, each calling on the 
capabilities of the. layer below it through well-defined 
interfaces or protocols. We have identified five distinct 
layers. From , highest to lowest these are the Applications 
layer, the ..Service, layer, the Conversation layer, the 
Transmission layer, and the Physical layer. 

The best way we have found to explain this organization 
is from the insidt out. beginning with the heart of trie 
architecture, the Conversation layer. It provides a uniform 
approach to the establishment and management of voice 
connections, or conversations among the various services. It 
also provides a standard method for distributing conversation 
state transitions and other progress reports to the various 
participants in each conversation. . All communications 
between services are mediated by Conversation layer 
facilities. In the Eiherphone system, the functions of the 
Conversation layer are implemented entirely within the Voice 
Control Server. However, the architecture does not mandate 
centralized control. Fc; example, Etherphones built with 
tei*ger memories and more powerful processors could support 
a distributed implementation, each managing the 
conversations that it or its associated workstation initiated. 

The Service layer defines the various voice-related 
services -such as telephone functions, voice recording and 
storage, voice playback, speech syndesis, and speech 
recognition -that form the basis for the voice applications. . 
Each of the sen/ices must follow the uniform Conversation 
layer protocols in creating voice connections with other 
services. However, each can register with the Conversation 
layer additional service-specfic interfaces (protocol 
specifications). Connections may be formed between similar 
services (as in a call from one telephone to another), or 
among different services (such as a connection from a 
telephone to the recording service, mediated by a workstation 
program). It is no: expected that ordinary programmers will 
produce new services; the services provide both the basic user 
facilities and interfaces to the building blocks for higher-level 
applications. In the Etherphone system, some services are 
implemented on the server machine that contains the Voice 
Control Server, others on separate server machines, still 
others on individual workstations. 

The Applications layer represents client applications thai 
use the voice capabilities of the architecture. To establish 
. voice connections, a client uses simplified facilities provided 
by a ser/ice that resides on the workstation along with the 
application. Gient applications also negotiate with the 
Conversation layer to gain access to specialized interfaces 
provided by other services. The previous section illustrated 
many of the present voice applications using the Etherphone 



environment 

Logically below the Conversation layer is the 
. Transmission layer. This layer represents the actual methods 
for representing digital voice, for transmitting and switching 
voice, and for communicating control information among the 
components of the system. In the Etherphone system, voice is 
transmitted digitally, in discrete packets, using a standard 64 
kilobits/second voice representation and our own voice 
transmission protocol. Other transmission and switching 
methods could be substituted without affecting the nature of 
the. programs operating in layers above the Conversation 
layer. Possibilities include synchronous digital transmission, 
or even analog transmission/ The only requirement is thai 
these components provide interfaces that allow the 
implementation of Conversation layer protocols. As we have 
mentioned, the control protocol selected for all control 
communications in the system was a locally-produced remote 
procedure call protocol.. Other remote procedure protocols or 
message-based protocols would work equally well. 

Finally, the Physical layer represents the actual, choice of 
communications media, for the transmission of both voice 
information and control (not necessarily the same media). 
Besides the research Ethernet that we use (operating at 1.5 
megabits/second), voice transmission on standard Ethernets. 
; synchronous or token-oriented ring networks, digital PABX 
^.switches, or analog telephone switches could be used/ 

Looking at the architectural layers, it becomes easy to see 
how our efforts differ from work being done elsewhere. Most 
of the current efforts to "integrate voice and data", such as 
those systems built around the Integrated Systems Digital 
Network (ISDN) definitions 4 , deal only with the 
Transmission and Physical layers. Other systems that include 
voice, such as the Diamond research effort at BBN 21 and 
commercial voice mail services, support some specialized 
applications exhibiting very scanty Service and Conversation 
layers. They mostly build directly on capabilities 
corresponding to our Transmission layer. By contrast we 
have concentrated our efforts on Conversation and Service 
layer specifications, and on the architecture in general. A 
project recently initiated at Bell Communications Research 
has been exploring similar goals and methods 6 . 

To date, only Oijc instance each of the Physical. 
Transmission, and Conversation layers has been 
implemented. We have used the resulting facilities 
extensively to produce the various Services and Applications 
described in the preceding sections. We have produced a 
relatively complete workstation service for Cedar 
workstations, and a preliminary implementation for Interlisp. 

We are not yet fully satisfied with the architecture, 
particularly the interface between the Conversation and 
Service layers. This interface has proven to be somewhat 
clumsy to use. while at the same time restricting the number 
of capabilities that can be readily produced. Recent progress 
is encouraging, however. 



166 



BMSdOClD: <XP_61754t AJ_L> 




5/ Related Work 

Incorporating "voice and/or telephony into a workstation 
environment isan active research area. 

Others have explored the use of local area networks to 
transport packetized voice. For example, the packet XCS 
system developed by Sincoskie and his colleagues at AT&T 
Bell Laboratories and later at Belt Communications Research 
also iises Ethernet 5 . In the Island system, the slotted 
Cambridge ring supplies guaranteed transmission bandwidth: 
voice and data are transported ori the same network 1 . 

. Work at IBM by. Ruiz addressed both voice and 
telephony 12 . Off-the-shelf voice and telephone hardware was 
added to an, office workstation. The system's prototype 
applications included directory-based autodiallng. call 
, logging, voice mail, telephone message recording and 
r retrieval, and voice, annotation of documents (one annotation 
per ImeX : Annotated documents were stored in three parts: 
the text, a file of pointers to voice annotations, anda separate 
file for each annotation. However, there were no jvoice 
editing capabilities except for the ability to append new voice,, 
and there was no visual representation of the recorded voice. 

The Sydis Information Manager is a commerciaJ system 
that provides voice and telephony 11 : It includes a shared 
central processor cluster with voice file storage, a voice 
processor, Jitxd' i' connection to the local PBX. Special 
workstations with integrated telephones called VoiceStations 
support directory-based autodialihg. voice and text mail, 
dictation and voice editing, voice annotation of documents 
(again, one annotation per line), and voice as a user interface 
output medium for items such as help messages or event 
l " announcements. The Sydis voice editing system uses a 
sound-and-silence representation similar to ours, but it lacks 
the ability to annotate the voiqe with text markers. 

Maxemchuk's speech storage system 10 , developed at . : 
AT&T Bell Laboratories in Jhe late 1970's. provided many of 
the same facilities. for recording, editing, and playing voice as 
our Voice File Server." Several additional features were 
included to permit scanning long voice messages: text 
markers that could be searched by a text editor, the ability to 
increase playback speed, and the ability to skip forward or 
backward in the message. Maxemchuk's system edited voice 
via divide and join operations that modified the control 
sectors of the stored voice message! By contrast, our scheme . 
of building data structures that reference segments of voice , 
files allows many users and/or voice annotations to share a 
single copy of the voice bits. 

The Diamond multimedia message system 21 , like the 
Etherphone system, manages documents that contain various 
media elements by reference. Voice annotations appear in a 
document as an .icon with a text caption: thus their placement . 
is even more, restricted than the one-annotation-peHine 
: systems above, but the captions are available for text searches. 
Because .-only Diamond documents can reference voice 
passages, "they can use a simpler reference counting scheme 
than ours. 

The MICE project at Betl Communications Research has 




constructed a centralized voice architecture to support rapid 
prototyping of communications services 6 , including remote 
call-forwarding, voice paging, voice mail, and combined 
voice/data messages with editing. The central control process 
of the MICE system interprets finite slate logic tables that 
specify the voic* services. These tables allow changes to 
algorithms and services without recompiling or reloading the 
system<which would interrupt service to users). 

In contrast to the MICE and Etherphone centralized 
server architectures, the approach used by Schmandt and his 

: colleagues at the MIT Media Laboratory associates a PC- 
based voice peripheral with a workstation 15 . The voice 
peripheral uses commercial speech and telephony cards to 
support voice filing and editing., telephony functions, and 

* speech recognition and synthesis. The system has been used 
to provide several sophisticated applications, including 
combined voice editing and recognition 13 , conversational 
telephone answering 14 , and office activity management 

6. Current and Future Directions 

Recall that .the Etherphone project began with a set of 
applications-level functional goals for taming the telephone 
and the tape recorder. After building a basic hardware 
platform for achieving these goals: early applications-level 
efforts convinced us of the need for a voice system 
architecture. More recent efforts have focused on developing 
the architecture and a few 1 interesting applications to 
demonstrate and explore its unique characteristics and 
flexibility. Voice project members have built most of the 
applications, although a few programmers have included 
telephone management or voice synthesis activities in their 
applications using interfaces provided by the Services layer. 

.We have begun to explore a nurnber of new directions 
and enhancements to the current capabilities. We have a 
skeletal implementation of call filtering that provides options 
based on the subject, urgency, or caller's identity to decrease 
the intrusiveness of the telephone for the callee. Our plan to 
integrate telephone conversation logs into the electronic mail 
system should have a side benefit of making the additional 
filtering information natural for the caller to supply. 

We are working on novel kinds of interactive voice 
connections, such as all-day "background** telephone calls, 
use of the telephone system to broadcast internal talks or 
meetings (as a sort of giant conference telephone call), and 
conference calls that allow side-conversations to take place. 
We have recently made it possible to register conversations by 
name (such as BudgetMeeting or RadioService); users may 
join these conversations as either listeners or participants. 

We plan to use our hardware-supported conferencing 
capabilities to incorporate text-to-speech or recorded voice 
into telephone calls. Among possible uses for text-to-speech 
are reading electronic mail over the telephone to a remote lab 
member as in PhoneSlave 14 or MICE 6 (but without 
dedicating a synthesizer solely to this task) and playing 
program-generated messages to callers, such as prompts or 
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reports of the user s location (possibly by consulting the users 
calendar program, such as "Dr. Smith is at the Distributed 
Systems seminar now. please call back after 5 o'clock"). 

We are also exploring a novel scripting mechanism for 
creating viewing paths through one or more electronic 
documents 23 . Built on the capabilities of the voice 
architecture, scripted multi-media documents can contain a 
combination of text pictures, audid. and action. Scripts can 
be us:d in a wide variety of ways, such as for formal 
dernonsuations and audio-visual presentations, for informal 
interpersonal communications, and for organizing collections 
of information. Scripted documents are a dynamic form of 
hypermedia document whose additional structure can be 
layered on top of existing Tioga documents. j. - : 

Finally, w; would like to extend the system to other 
media, such as still and real-time video, other workstations, 
and other architectures. 

In concert with wene on providing new features, we 
intend to continue to investigate the underlying systems and 
theories. Managing real-time and stored voice in a 
distributed environment presents many interesting problems 
in the areas of distributed systems 19 , user interface design, 
and voice transmission and processing technologies. 
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Table 1. Conditions for Transmitting and Display on Communication Devfce 



Ing number". . 



Number is 



Private telopltona 
Public telephone 



Number is blocked 

{both private and public telephones) 

Calling from network Incapable of 
transmission 

Network conditions prevent 
transmission 
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number could not be advised due 
to the conditions such as when a 
mobile phonies roaming (Table 1 ), 
Also, as an option, a function 
called Number Request is pro- 
vided which will respond to an 
incoming call in which the call- 
ing number is not advised by 
sending a message requesting the 
caller to call again and advise the 
calling number (Figure 2). 

When subscribers dials 136*, 
the Number Announce service 
sends a voice message advising 
the calling number of the latest 
call that was received (Fifiura 3). 
This service will be used by usual 
telephones. 

Furthermore, in order tc 
prompt the proper use of calling 
number information, NTT hes 
added a rider to tLe agreement 
when, the service is used, that 
"Guidelines related to the protec- 
tion of private caller information 
in using caller information advice 
service** 2 wiD be respected. 

Information of Colling 
Number 

Passing and Blocking 

A change to the articles in the 
Telephone Service Contract was 
approved in July 1996, under 
which passing the calling num- 
ber to the party receiving the call 



was included in the basic nature 
of telephone service. However, 
since there are occasions when 
privacy in a telephone call must 
be maintained, on request from 
the customer, the caller is now 

(2)Caflissentwitfioutadvislr>9 
calling r.^nber . 

Calling 
exchange - 




(3JReason for ^ 
rwt edVlsing* 



•2 Guidelines: Guidelines Issued in No- 
vember 1 990 by the Ministry of Posts 
and Telecommunications for the pur- 
pose of obtaining proper use of private 
information about caliers. including tele- 
phone numbers. 
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"This is 0451234567 (example). 

Please call again and 

advise your telephone number." 



C ReceiyeT) 
(Service user) 



Figure 2. Summary of foiniber Request Service 
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(Service user) 



"A call was received from 0451234567 (example) * 



Figure 3. Summary of Number Announce Service 
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■his paper discusses the adoption, interconnection and principal applications (CTI-relat- 
ed) of services such as Number Display. 
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Introduction 

NTT began to provide a full 
scale "Number Display" (a ser- 
vice that displays the calling 
number) in the three areas* 1 of 
Yokohama, Nagoya, Fukuoka 
from October 1997. The service 
began to be offered in other areas 
from February 1998. 

Number Display enables the 
subscribers to know the calling 
telephone number of an incom- 
ing call (hereafter referred to as 
the calling number) before that 
call is answered. The main appli- 
cations for this service will be (1) 
as a counter to nuisance calls and 
to callers who will not leave a 
message on the answering ma- 
chine, (2) for customer service 
work at call centers and (3) call 
management. A wide range of 
uses is envisaged in both family 
life and industrial activities. 
With a background of recent 
rapid popularization of the 
Internet and electronic mail and 
reduced prices of highly func- 
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tional personal computers and 
peripheral devices, in fields such 
as call center related systems in 
particular, computer telephony 
integration (CTI), a move to cre- 
ate a new communication envi- 
ronment through the integration 
of telephone and computer, is 
appearing. 

Summary of Services 

Number Display is a service 
in which the caller's telephone 
number is displayed on a display 
on the receiver's telephone before 



the call is answered (Figure 1). 
This service requires a commu- 
nication device with a specific 
function for receiving the calling 
number. At the time of the trial 
presentation, the data being ad- 
vised from the network to the 
communication device included 
w Calling number", u CT t "P" and 
"O", with the addition of a S" indi- 
cating a case in which the calling 



♦1 Three areas: Calling number display 
service had been tried In these areas 
from end January to end June 1997. 



(1) Advise and transmit 
calling number 

Catling exchange 



Receiving exchange 
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(Service user) 

Figure 1. Summary off Number Display Service 
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up customer data was for a re- 
ceptionist to ask the caller or to 
have tl*e customer enter the in- 
formation with a IVR* 3 which is 
seems meteo efficient However, 
with this service, data such as a 
telephone number passed from 
the network is read directly into 
a computer and is used as the key ~ 
for looking up customer informa- 
tion < which can then be displayed 
on the computer screen of the 
person receiving the call before 
the call is answered. This is called 
intelligent routing. This makes 
the work of reception very much 
more efficient. 

Figure 4 shows an outline of a 
configuration of CTL Interfaces 
between each device are in the 
process of being standardized 
(Table 3) and products with built- 
in sound boards and fax boards 
for unified message systems** 
and CTI servers with PBX func- 
tions for small-to-medium call 
centers are being offered. 

; : With the popularization of . 
the Internet in the United States 
and elsewhere, multimedia call : 
centers (Figure 5) are beginning 
to appear that have the informa- 
tion providing capability of tf 
WWW server and handle custom- 
ers with Internet telephony while 
they watch a WWW page. Also, 
new services are appearing that 
integrate the IP network with the 
PSTN (Public Switched Tele- 
phone Network), such as that 
shown in Figure 6, where a user 
browsing the WWW via a tlial-up 
connection can use Caller-ID* 6 to 



*3 IVR: An automatic telephone t answer- 
ing device wtei a voice response unit for 
handling caner requests. 

♦4 Unified message system: A message 
system that integrates e man, voice mail 
and faxes. Equipped with functions to 
announce the arrival of e-mail and voice 
maif to portable telephones and pagers 
and to convert text to voice . 

*5 Caller-ID: Name of a service {carting 
number display service) being offered 
in USA from 1997. 



take an interrupt call from a tele- 
phone connected to PSTN. 

Plan to Upgrade Service 

With regard to calling num- 
ber display, the results of a sur- 



vey in the trial areas in connec- 
tion with future services have 
revealed a strong need for caller 
name-display. A review of condi- 
tions in the United States shows 
that Caller-ID had spread to 
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able to control of revealing the 
number by using a number op- 
eration. Two types of blocking 
were made available by the sub- 
scriber telephone service. One of 
these is Per Call Blocking which 
involves dialing "184" before the 
called number to prevent the 
calling number being displayed 
to the party being called. The 
other is Per Line Blocking in 
which the display of the calling 
number to the party being called 
is blocked for all calls from a pre- 
viously specified line, even when 
the conventional dialing proce- 
dure is followed. Per Line Block- 
ing is provided with the Single 
Call Display function that can- 
cels the blocked condition when 
w 186 ,, is dialed before the called 
■> number. Meanwhile, functions in 
the "INS Net* (Integrated Ser- 
vices Digital Network (ISDN) 
services) that display and block 
the calling number have been 
brought in line with those of the 
telephone service. 
Changing the Calling Number 

The calling number that is 
displayed is based on the tele- 
phone numbelbf the line that is 
being used by. the caller but, for 
convenience, the subscriber 
(caller) can apply for another 
number to be passed under cer- 
tain circumstances. 

If a pilot number is being 
used, the pilot number from a line 
within the same hunting group 
may be passed for display. If Di- 
rect Dial-In is being used, a pre- 
viously specified optional num- 
ber from the Direct Dial-In group 
may be passed. Moreover, when 
Free-Dial "0120" Service is being 
used, the. Free-Dial contract line 
and the Free-Dial number from 
each line within the same hunt- 
ing group may be passed. Now, 
the passing of Direct Dial-In and 
Free-Dial numbers is an extra 
function for which a charge is 
mace. 
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With INS Net, the pilot num- 
f ber and the Direct Dial-In num- 
ber can be passed by the calling 
number display function (the 
basic function)! • 

Interconnection 

On the premise that the 
caller's request with respect to 
revealing or blocking the num- 
ber will be delivered reliably to 
the party being called, there is a 
mutual interchange of number 
data and other information be- 
tween NTT and other carriers. 
When the full scale service is 
provided, calling numbers will be 
able to be passed constantly be- 
tween calls within the NTT net- 
work (including calls through 
long distance carriers), suid.be- 
tween most mobile and PHS car- 
riers. 

The Applications 

Due the attributes of calling 
numbers, such as unique and 
widespread throughout the na- 



tion, we can think of various uses 
for the services in the home and 
in business. 

Main uses in the home could 
include the avoidance of nuisance 
calls such as non-speaking calls 
and persistent telephone sales 
calls. If the telephone number is 
known beforehand, the call can 
be handled by an answering de- 
vice or the Call Refusal function 
(stop bell ringing).. We expect that 
encouraging a telephone-depen- 
dent society that assumes the 
callmgnumhier wi^tbe transmit- 
ted, will serve to deter wilful 
nuisance cMls. Table 2 shows the 
other main functions that will be 
able to be used by dedicated com- 
munications devices by means of 
this service. , 

Business uses would include 
CTI in call centers. CTI had been 
advocated previously but in con- 
ventional systems, the only way 
to obtain information such as 
names and telephone numbers 
which are necessary for looking 



Table 2. Examples of Functions Enabled by Communications Devices 



: \ Haxri i [ 
Display calling number 

Save calling number 
Transmit to saved number 

Display caller's name 
Distinctive ringing 



Display reason for 
number Mocking 



Call forwarding 



Answering method 
selection (Call refusal) 



Function 



Caller's telephone number is displayed 

Cafler*s ?&i6pnone number is saved to memory 
and displayed 

Initiates a call to the caller's telephone number that 
has . been saved to memory 

if the caller's telephone number is the same as a 
previously registered telephone number, the caller's 
name, et&ls displayed 

Specifies a distinctive ringing sound for a call 
received from e previously registered telephone 
number and specifies a number of ringing sounds 
for registered telephone numbers 

Displays the reason for the calling number not being 
displayed 

Designates of receiving equipment (cordless stave 
unit etc.) in accordance with calling number 

A caU whose calling number is not displayed, or whose 
calling number has been registered lor refusal, is 
answered automatically with a recorded message 
then feated as a nuisance call by hanging up, eta 
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Table 4. Status of Calling Number Display Services Provided In Various Overseas Countries 

, Name of country • 1990^. 1991^ 1992; 1993 , 1994. 1995 ** 1996, 

North United States 

America Canada ^ 

Cantml^and Mexico t \ V *- * * i: 5 88 

Souih Ameffca Columbia . fc^tfe - : ' - 

.;. r ' .!."... United Kingdom 
Sweden 




Europe 



Netherlands 

Belgium 

Spain 

France * 

Germany 

Denmark 

Norway 



v ^-S^^f- Turkey 



MkkSeand 



terae) 



Near East 

■V >I'«*i^^ : Saudi Arabia 

'■ ••'v^? : • Australift 

East Asia ' ] . New Zealand 

and Oceania^ Singapore 

" • , *. ■ J- i- Hong Kong 



numerical data) and the privacy 
issue. 

Meanwhile, although the 
Number-Announce service w&s 
considered to be a service from 
the early period of information 
notification services, since it is a 
service that does not require spe- 
cific adapters and communica- 
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tion devices and it is in high de- 
mand, we will continue to study it 

Conclusion 

In the future! we will continue 
to improve customer convenience 
by upgrading services which 
meet the heeds derived from new 
movements such as CTI. 
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about 20 % of the whole of the US 
at the end of 1996 and in NYNEX, 
about 95 % of Caller-ID seem to 
be shifting to Caller-Name* 0 . If 
Caller-Name is limited to within 
each Regional Bell Operating 
Company (RBOC), it can be used 
in almost the whole of the United 
States and notification of related 



information between RBOCs is 
gradually being extended as con- 
tracts for the interchange of indi- 
vidual names are concluded. Table 
4 shows the status of services to 
display calling numbers in vari- 
ous overseas countries. 

Based on this background, we 
would like to promote continued 



studies of the passing of name 
data and public telephone data 
by considering the impact on net- 
works and communication de- 
vices (of handling other than 

•6 Caller-Name: Offered as an extension 
of Caller-ID (for customers including tn 
Caller-ID) which advises name of sub- 
scriber. 
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Call Centres in BT UK 
Customer Service e 



More and more business 
is being done via call 
centres, both within the 
UK and worldwide. BT 
has been involved in call 
centres for many years. 
This article reviews the 
current situation in 
oi ia Customer Service. 

and considers the next 
steps in this increasingly 
comoetitive and 
cr(dJ.engin.ii environment. 



Introduction 

Call centres are a growing industry, 
in a recent Ovum Report it was 
estimated that by 2000 5% of the 
working population in Europe would 
be employed in call centres. It is also 
predicted that call-centre revenues 
globally will be $6 billion. Call 
centres have become the core of the 
service economy in the UK Counting 
part-timers, UK call centres cur- 
rently employ 1*7% of the working 
population, or nearly 400 000 people. 
And the numbers are growing. 
Datamonitor predicts that call-centre 
positions will double by 2002 before 
beginning to level oft Since 1994, 
customer calls to large organisations 
have roughly doubled, with call 
centres being largely responsible. 

BT UK is a leader in call-centre 
technology and expertise. It operates 
its own call centres and manages 
them for other organisations. BTs 
own call-centre operations fall into the 
two broad categories of outbound and 
inbound. Outbound operations involve 
companies staff calling out to custom- 
ers, usually to offer new services, ' 
while inbound call centres handle 
enquiries from customers. This article 
looks at how the inbound call centres 
are presently used in the following 
environments, which between them 
are responsible for handling approxi- 
mately 1 billion calls per year 

• answering Operator Assistance 
enquiries f 100, 155 and 999 
emergency services), 

• giving accurate number informa- 
tion Q92 and 153 services), 

• receiving sales and billing enquir- 
ies and taking orders (150), and 
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• taking fault reports from residen- 
tial and business customers ■. 151 
and 154 i. 

It does not consider some of the 
more specialist centres < for example. 
Welsh language. Directory Enquiries 
for the blind and disabled) or 
inbound services and business after- 
sales units, although these all form 
an important part of the BT Cus- 
tomer Service call-centre manage- 
ment operation. 

The article explores where we are 
now" and in particular focuses on 
systems, switching and queuing 
capabilities. Finally, it looks at the 
future of call centres and gives some 
indication of the way things will 
change. 

Evolution 

Although the title cell centre would 
not have been used. BT has been 
taking calls from its customers ever 
since telephony was introduced. In 
the early days, these were large 
operator centres serving the geogra- 
phy around the exchange. Over time, 
separate centres for activities such as 
sales and fault reporting were 
introduced, but these again were 
geographically based, generally small 
and supported only by simple 
technology. 

The evolution from Telephone 
Areas to Districts and then to a more 
functional organisation significantly 
reduced the number of centres, but 
there are still over 100 units and by 
modern standards many of these 
would be regarded as small. However, 
the introduction of both switching 
and systems technology has enabled 
BT to maximise the efficiency of 
these units, although more can and 





Figure 1 -Operator Assistance 
information systems 



will be done (see paragraph on ca-1 v 
centre futures i - - 

Operator Assistance 

/Originally the operator was the only 
■connect through' service for all calls. 
With progressive network automation, 
this capability is now only used when 
a customer requires assistance, when 
a customer encounters difficulty ■ for ' 
example, permanently engaged j. or " ~ 
. when the customer wishes to use 
special facilities such as alternative 
payment options. BTs Operator 
Assistance covers assistance »: 100, • 
155 » and emergency (999* services 
and currently handles 500 000 balls ; 
per day at 15 call centres. 

Network and systems 

Figure 1 illustrates the information -•* 
systems involved in Operator 
Assistance. 

Common call-centre technology 
supports all types of call for Operator 
Assistance (100), emergency services 
1 112. 999) and International Operator 
Assistance * 155). System X 
BT operator services subsystem ^ 
(BTOSS) switches route calls to a free; 
operator together with associated 
network data -such as calling line 
identity <CLI) and class of service. The . 
operator assistance workstation 
t OAWS I, developed by BX provides 
integration of the telephony and 
information systems, to present 
operators with a highly integrated 
and user-friendly interface to the 
underlying systems. This allows it to: 

• obtain information from the 
network via BTOSS, 

• obtain information from the 
external database <EDB> (state of 
line, name and address etc. J. 

• obtain information from the 
international database i IDB) 
(time zones call tariffs, routing. 
Inmarsat), and 

• vet BT Chargecards using CSRS 
(Via EDB 'as a gateway i. 
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BT0SS: 8T OPERATOR SERVICES SUBSYSTEM 

dSA: CHARGING INFORMATION SUPPLY AND AUDIT 
' CSRS: CASHLESS SERVICES REPLACEMENT SYSTEM 

CSS CUSTOMER SERVICE SYSTEM 

CSU: CUSTOMER SERVICES UNIT 

OLE: DIGITAL LOCAL EXCHANGE 
; EAVfrS: EXTERNAL ASSISTANCE WORKSTATION 
QSEC: WHANCEO INFORMATION SYSTEM FOR 

' EMERGENCY CALLS 
- EOB: EXTERNAL 0ATA8ASE 



To maintain the accuracy of EDB, 
daily data feeds to EDB are received 
from both the customer service 
system (CSS) and other licensed 
operator receipt system 2 (LORS2). 

The design employs object-oriented 
software and a script engine to achieve 
a very efficient and flexible user 
interface to capabilities which embody 
the required operating procedures. The 
workstation also incorporates voice 
technology, which automatically plays 
a greeting announcement in the 
operator's own voice on call arrival. 

The handling of emergency (999) 
calls demands extremely high 
systems resilience and the architec- 
ture is designed to achieve 99' 9999** 
systems availability. Information 
required to handle 999 calls is held 
locally and is available from remote 
servers in the event of failure. 
Further fall-back facilities are also 
provided to protect against multiple 
failures. Similarly, multiple levels of 
network fall-back are used to ensure 
network capacity Is always available 
for these calls and to automatically 
route around network and call -centre 
failures. A recent enhancement has 
been the introduction of the en- 



IBTN: INTERNAL BACKBONE TRANSMISSION NETWORK 
ICN: INTERNAL CORPORATE NETWORK 
IDB: INTERNATIONAL DATABASE 

IP: INTERNET PROTOCOL 
IVR: INTERACTIVE VOICE RESPONSE 
L0RS2: OTHER LICENSED OPERATOR RECEIPT SYSTEM 2 
NOtSfc NATIONAL OPERATOR INFORMATION SYSTEMS CENTRE 
OA: OPERATOR ASSISTANCE 
OAC: OPERATOR ASSISTANCE CENTRE 
OAWS: OPERATOR ASSISTANCE WORKSTATION 



hanced information system for 
emergency calls i'EISEC). This 
provides the caller name and address 
information for a 999 call to the 
emergency authority through the BT 
firewall saving critical seconds in the 
response to an emergency. 

Limited access, via the BT intranet, 
is also provided to the service manage- 
ment system. This enables operators to 
check for and report faults when 
required and transfer callers to an 
appropriate service department. 

Directory Assistance (DA) 

BT has a broad range of number 
information products including the 
well known Phonebooks and more 
recent innovations such as 
PhoneNel -the Phonebooks on the 
Internet ' wu-w.ht.co m/'phonenetuk ». 
Directory Enquiries is a key part of 
the portfolio handling some 650 mil- 
lion calls a year at 47 DA call centres. 
Customer care, speed and accuracy 
are high priorities in handling 
Directory Enquiry' calls. BT has 
recently completed the ATLAS 
programme, deploying state-of-the- 
art technology into this service. 
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Network and systems 

Figure 2 illustrates the cull centre 
and information system.*; tor BT 
Director/ Enquiries. 

The DA service uses Lucent 
Technologies 5 ESS automatic call 
distribution (ACD* switches. Put 
simply, the purpose of the ACD is to 
place each incoming call into a 
single large queue so that the first 
free operator can answer it. irre- 
spective of where the DA centre is 
located. However, for the service to 
operate efficiently and effectively, 
calls need to be handled by opera- 
tors who have the best match of 
skills to the call. This is achieved as 
follows: 

- O Incoming calls are allocated to 
queues based oh the locality of the 
caller i derived from the CLI.i.- 

• Operators are grouped into 
serving teams 'skill groups* 
identified by operator identity or 
groups of workstations. 

• Through configuration data, 
serving teams are allocated to 
each queue as the first choice to 
handle calls from a queue: the 
primary team. The configuration 
tables 'Figure 3> also define the 
sequence of other serving teams to 
which a queued call is offered if a 
free operator in the primary team 
is not available within the desired 
answer time (the secondary team I. 
The configuration tables can be 
changed dynamically and set-up 
to change automatically; for 
example, when a call centre opens 

. or closes. 



Figure 3 -Queue configuration table 
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The ACDs are also integrated 
with management information and 
workforce management systems 
which are used to manage service 
quality and achieve an optimum 
match of operator resources to 
incoming call volumes. 

Operator workstations 

Directory Enquiry operators use the 
Directory Assistance workstation 
(DAWS). As with the Operator 
Assistance workstation (OAWS), this 
provides integration of the ACD and 
information systems, and employs an 
intelligent application. This presents 
operators with a highly integrated 
and user-friendly interface to the 
underlying systems and which 
supports the business processes. 
When a call arrives at the 
workstation, an automatic greeting 
in the operator's voice plays. The 
main function of the workstation is to 
provide operators with access to 
search the national directory infor- 
mation database for numbers and 
code information. When the correct 
number has been located on the 
database, the operator then transfers 



the cull to the speech applications 
server «SAS>. which automatically 
announces the number with the 
familiar phrase: 'the number you 
require is 

Pathfinder database 

The heart of the system platform is 
the national Directory Enquiry 
database (XDIS), supplied by 
Hewlett Packard and PCPlus. 
Dimensioned to hold over 150 million 
listings, it will perform a search and 
return information back to the DAWS 
screen in less than 0 5 seconds. The 
database receives 40 000 updates per 
night. 

Customer Service and 
Sales, Account Services, 
Sales and Billing and 
Residential Repair 

With the evolution of call centres 
described earlier, similar technology 
has also been deployed into customer 
sen-ice centres fCSCs) supporting 
150 ( account services, sales and 
billing i and .151 (residential repair) 
calls. 150 accounts for approximately 

1G- 
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300 000 calls per day. with about 
70 000 per day offered to 151. 
However the arrival pattern on both 
channels can.be .very irregular .both 
during the.day (see Figure 4"« and 
through the week, with particularly 
?high call levels being experienced on 
Monday morning or in periods of bad 
weather. 

Weftwork and systems 
The 55 customer service centres 
throughout ihe UK -comprise, on 
average, 120 agent positions and are 

i served by two Nortel DMS-100 
switches (Figure 5t. The switches are 

: located remotely from the call 
ccatres. providing ACD functions and 
support load balancing between the 
ACDs. Call routing strategies are 
configurable and can be evenly 

"distributed or weighted to a parties 
lar ACD group or groups. Calls can ' 

' overflow after a given threshold has 
been exceeded— either on number of 
calls in queue or length of time in 
queue. 



As with the previous call centres an 
intelligent agent workstation is 
employed -SMART. SMART is a 
three- tier distributed system, which 
provides a graphical user interface' to 
the billing and order entry capabili- 
ties of CSS and to the data of the ' 
products and services database 
(PSD), it supports a set of 
configurable business rules, which 
reduces errors and provides a 
simpler more efficient set of proce- 
dures compared with native use of 



Figure 5 -DMS 100 configuration 
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Figure 4— typical days call pattern into 151 repair 



CSS. It also provides sophisticated 
sales support tools which prompt 
advisors with on-line selling 'hints 
on the products or services best 
suited to the particular customer. 

< The workstation client is a PC, 
hosting SMART and other applica- 
tions orientated to the needs of the 
inbound channels. A mid-tier sup- 
ports local data, particularly business 
rules data, a communications 
gateway to the CSS systems and a 
set of 'operability* features designed 
to facilitate remote management of 
the SMART system across all three 
tiers. 

systems {ftflOSJ 
Good customer service depends on 
having the correct numbers of agents 
available to deal with the calls arriving 
at that particular time of the day. The 
resource and administration teams use 
management information displays to 
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monitor service quality throughout the 
day. Most widely used in Customer 
Service is the Nortel RT1000 which 
displays real-time information on 
number of calls, number of available 
agents, number of calls queuing, length 
of time in queue, etc. Additional 
advisors are brought on-line when 
needed; however, a sophisticated 
forecasting and scheduling package 
ensures mat accurate staffing is 
achieved based on projections derived 
from historical data gathered over the 
previous three years. RT1000 has a 
real-time adherence monitor, which 
alarms if the ACD queues are over- or 
under-staffed. 

RT1000 also provides historical 
reports and is able to change the call 
flows in real time via load manage- 
ment commands to the DMS ACDs. 

CaQH steering 

In order to deal efficiently with the 
variety of call types being offered, 
and to route them to advisors with 
the required specialist skills, auto- 
mated call steering is used. 
Syntellect interactive voice response 
'IVRj platforms are used in conjunc- 
tion with Gcnesys computer tele- 
phony integration < CTI i servers and 
the Nortel DMS ACDs to capture 
details of the calling customer and 
the sen-ice request and then to route 
the call to the most appropriate team 
of scn'ice advisors. CTI is also used 
to present the- customer account 
details to the advisors on call 
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arrival -so called SMAUT semen- 
pops. IVR-based call steering ! Fig- 
ure 6* permits a straightforward 
customer contact strategy while 
ensuring customers arc routed to the 
correct ACD queue via simple menu 
selection from their telephone 
keypad. 

Automation . 

Customer Service has also imple- 
mented a number of interactive 
customer 'self-service' applications, 
which enable customers to conduct 
simpler service transactions them- 
selves using IVR systems. Examples 
include: changing Friends and 
Family calling circle numbers. 
. simple billing enquiries, initial 
reporting and progress enquiries for 
faults. The fault reporting applica- 
tion performs a line test on the 
suspected faulty. line and either 
reports the. fault automatically or 
connects the caller to a service 
advisor. If the later option is chosen, 
the results of the line test are 
presented to the advisor's screen at 
the same time as the voice call using 
the CTI facilities as above. It is 
estimated that the automated work 
volume is equivalent to three call 
centres. 

These automation services are 
also available to customers via the 
Web t bt.com \ and. via the BT 
intranet, to advisors in other types of 
call centres: for example. Operator 
Assistance or Outbound Sales. 

Business Customer Service 

BT Business Customer Service has 
five main components comprising 
154 Fault Management, OSOO 666777 
Alter Sales, Customer Relations. 
Simplex Management and the BT 
Own Use Service Centre in Glouces- 
ter. The combined units have in total 
around 1200 sen-ice advisors distrib- 
uted across 27 centres throughout 
the country. The largest proportion of 
these people are within the 154 fault 
management centres (FMCs), which 
comprise around 600 people in 
12 different locations, followed by 
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After Sales with around 250 people 
in nine locations. 

The customer segments supported 
by these centres range from single- 
line business customers through 
small/medium enterprises up to 
major customers. The customer base 
is some 1-2 million customers with 
some 4 million accounts. The Fault 
Management and After Sales units 
are predominantly inbound with 
Customer Relations and Simplex 
Management involving outbound 
calling. 

Network and systems 

Compared with the residential area, 
the volume of calls handled are far 
lower but calls are much more 
complex. This is being fuelled by the 
growth in data products and services. 
As a result, the BT network how 
carries more data traffic than voice 
calls. 

Calls are presented to advisors by 
customers calling .154 or via 0800 or 
dedicated numbers, all of which are 
all delivered to a network of Nortel 
Meridian switches: the CSC network. 



mnvnary 



The CSC network then uses ACD 
functionality to deliver the oldest call 
in the network to the longest waiting 
agent. The CSC network also seeks to 
deliver calls from customers to their 
'home' fault management centre: that 
is, if a call is identified as being 
generated from East Anglia. the 
network attempts to deliver the call 
to Ilford FMC first. 

With the introduction of Marks- 
man to Business Customer Service, 
calls are still delivered by the 
Meridian CSC network, but then 
Genesys CTI takes control of the call 
and the routing strategy for it. If a 
customer has an outstanding order, 
fault or issue with BX CTI firstly 
tries to route the call to the advisor 
who originally dealt with the call. If 
that agent is not available, then it 
tries the ad visor's team, then the 
, remainder of the centre before 
looking for the next free advisor in 
the country' Planned enhancements 
include the introduction of intelligent 
call routing (ICR) enabling sophisti- 
cated call plans and the use of IVR to 
identify customers and the numbers 
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Figure 7— Marksman overview 



CALL ^TOUCHTONE 
CUSTOMER O- 



: INTER' CTtV'JVOICc 
" RESPONSE . 



they are reporting faulty. Figure 7 
shows how Marksmun delivers a call 
using CTI. IVR and the operational 
customer database *OCDB>. 

Operational irequirenrieinifcs 
With the diversity of customers and 
the product set that needs .supporting, 
speed of information retrieval is the 
key to resolving a problem at the first 
point of contact. The Marksman 
programme has put in place a PC and 
local area network ' LAX i infrastruc- 
ture within the service centres upon 
which a series of high-speed applica- - 
tions can be built. The applications 
make use of a variety of technologies, • - 
ranging from bespoke Visual Basic • 
(YB) front-end screens to wer>based, 
information pages. The ability'to'test 
customers' apparatus is also important 
particularly with respect to their lines. 
For complex problems such as those 
posed by ISDN and Business Highway, 
Marksman uses case-based reasoning 
i CBR:' to improve the probability of a 
correctly diagnosed fault resolution. 
Integrated with the CBR is an ISDN 
line test system t IFETS i which checks 
the customer's line end-to-end. PSTN 
lines can also be tested via the Marks- • 
man application: In future, line tests 
wilt be performed automatically using 
IVR collected data while the advisor is 
confirming the customer details. 
Further benefits this delivers are: 

o More problems resolved at the 
first point of contact without 
increasing call handling times. 

o The application is flexible ena- 
bling the introduction of support 
•for new and bundled product sets 

O The management information 
statistics 'MIS) allow manage- 
ment of service levels and advisor 
performance on a real-time basis. 

Figure S shows the components of 
the Marksmun solution. 

WJarCtsman (technical! overview 

The agent workstations are standard 
desktop PCs. Automated software 
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delivery to the PC ensures that the 
current versions of software are 
always maintained. In addition to the 
Marksman application, the agent 
workstations are provided with 
standard office automation packages 
and e-mail. The function of the back- 
end systems is detailed below: 

The customer handling intermedi- 
ate server (CHIS) and common 
handling of event service system 
t CHESS) are mid-tier servers 
providing an interface between the 
OCDB and G3S.Tha mid-tier 
services use event notification to 
ensure that fault, order and issue 
data on CSS is updated on OCDB. 

Genesys T server is made up of 
several different components, namely 
router, stat server, call concentrator, 
and database server. Together these 
components provide the heart of the 
CTI call delivery and drive the 
intranet-based statistical information 
on advisor performance. 

The five case-based reasoning 
*CBR) servers provide the interface 
between advisors and the main CBR 
case base server. The five interface 
machines provide an equal load 
sharing a;*d back-up capability for 
advisors logging onto case bases for 
resolving customer problems. 

. The wnli L-ccrd server drives the 
tri-colour wall boards in the centres 
which are used to inform advisors of 
the current status regarding custom- 



ers waiting or advisors waiting for a 
call. They also inform management of 
advisor status: that is. number of 
positions manned, number of advisors 
either on a call, not ready or waiting. 

The script repository is used to 
hold scripts which automate repeti- 
tive tasks that would otherwise be 
time-consuming and tedious. 

Call! ©enfre Futaires 

One thing that con be said with 
certainty is that the information 
revolution taking place in society 
today will have a dramatic impact 
upon call centres. 

BTs call centres have evolved in 
an age where the telephone is king, it 
forming the principal real time access 
method into the business. This 
philosophy is extended into both front- 
and back-oflice support systems, 
which today are organised around 
customers* telephone numbers. 

The deployment and integration of 
desktop computing and CTI into call 
centres has brought about significant 
improvement in efficiency and the 
customer experience. Simultaneous 
voice and customer data delivery at 
the time of call arrival and the ability 
to route calls depending upon 
customer service advisor skills levels, 
are examples in use today 

Tins incremental approach to 
improving call centre infrastructure 
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has kept BT in a competitive position, 
but is it sufficiently radical to leapfrog 
the competition as BT confronts the 
information revolution, and will it 
deliver the efficiencies and capabilities 
■ needed to run a world-class operation? 

Site rationalisation 

Piler extensive research, both in the 
UK and abroad, plans are being 
developed to consolidate the existing 
call centres. Modular centres, which 
are sized for the optimum number of 
workstations will be created to make 
most-effective use of accommodation 
and management skills. This will lead 
to fewer, but larger centres. These will 
be provided with a common systems 
hardware infrastructure, offering high 
resilience and flexibility. 

Customer touch points 

Today we are witnessing a massive 
growth in the communication chan- 
nels that arc being made available to 
customers; the Internet, mobile 
communication;) and Internet-enabled 
IV are all areas experiencing fast 
customer take up. Figure 9 illustrates 
the communication channels which 
will be available to customers. 



This spectrum of communication 
channels will grow, as we come to 
terms with the transformation taking 
place- in communications, moving from 
voice to a data centric environment. 

How will this change impact 
on BT's call centres? 

.Customers increasingly expect to be 
able to do business via the communi- 
cation channel of their choice, with 
the 'experience* being consistent and 
content rich. 

- i To satisfy this customer expecta- 
tion requires a new approach to the 
way call centre systems are designed 
and delivered. Historically, these have 



been developed largely in isolation to 
. satisfy the specific needs of each 
channel; for example, 150. 192. This 
has resulted in a very close fit with 
business requirement and world -chws 
performance but at the expense of 
wide flexibility A key constraint 
arising from this Vertical' approach is 
to prevent the total call centre 
resource being regarded as a single 
resource "pool' at times of peak 
customer demand and makes organi- 
sational change difficult. 

By adopting a 'horizontal' ap- 
proach to the systems architecture of 
call centres, both new channel 
integration and reuse of common 



Figure 9 -Communications channels to call centres 
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Figur* 10 ^Three-layer model . . 

system components can be achieved. 
Figure 10 illustrates the concept. 

. The approach has a number of 
benefits: 

o For the construction of a new 
sen-ice ch?J&iel, for example, 
Internet-enabled TV, the re-use of 
common business services in the 
layer 2 mid-tier enables both the 
channel and the existing call centre 
infrastructure to support this with 
minimal new development Hence 
lower costs and faster time to 
market result when compared with 
the traditional approach. 

o Dynamic integration at the desktop, 
of the applications and information 
required for an individual advisor's 
skill set increases the flexibility of 
the call centres and the advisor 
resource pool. 

o Provision of integrated workflow 
capabilities enables a customer 
service advisor to work on a 
dynamic blend of customer service 
requests including voice calls, 
e-mail. fax. written correspond- 
ence, etc. 

This infrastructure has one further 
major benefit: it forms the backbone 
by which seamless customer-centric 
service can be delivered, utilising a 
complete view of easterner relationships 
to ensure service is tailored to their 



individual needs and preferences. The 
call centres will increasingly take on the 
roles of help desks, providing immediate 
context sensitive support to customers, 
with the .majority of service transactions 
handled in aVzero-touch* manner. In 
other words, service will be delivered to 
the customer without any person being 
involved except perhaps a customer 
engineer. In this environment, if the call 
centre is, contacted, it will be increas- 
ing in a secondary 'assistance* role 
rather than a primary transaction' role. 

Transactors £© relationships 
In a business ^environment, where BTs 
core network is no longer a competitive 
advantage and where it is possible to 
build a new telco from scratch in under 
five months, how does BT compete? 

Jack Welch, CEO of General 
Electric, made the following state- 
ment, 'We have only two sources of 
competitive advantage: 

o The ability to learn more about 
our customers than the competition, 

o The ability to turn that learning 
into action faster than the 
competition/ 

BT is confronted with exactly the 
same challenge, a customer that has 
brought a product via bt.com will 
expect BT to know about it when they 
ring in with a service call that same 
day, and they will not expect to receive 



a call asking if they would like u> 
purchase the very same product! 

As the number of customer commu- 
nication channels increases and BT 
moves towards the bundling of 
products Via propositions, the company 
needs an integrated information 
service that supports a complete view 
of the customer relationship, delivered 
via a powerful interface to the cus- 
tomer service advisors. This would 
enable the complete customer relation- 
ship to be visible no matter how the 
customer makes contact. 

Delivering systems and a ppb* cations 
that support this level of functionality 
will be a significant challenge. How- 
ever, the increasing functionality and 
maturity of proprietary CRM and ERP 
packages, means that it is feasihle to 
configure standard software packages 
to achieve this rather than embark on 
bespoke development. Availability of 
'out-of-the-box solutions will be 
essential to support the evident rate of 
change in the marketplace. 

Several initiatives are underway- 
looking at how BT can satisfy this 
requirement, with vendors of both 
front- and hack-office applications 
being assessed. 

In summary. BTs Call Centres will 
change dramatically over the next few 
years as customer service. BTs true 
competitve advantage, is optimised. 
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